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ABSTRACT 

Background: Stuttering is a speech disorder that affects more than 70 million people worldwide, limiting their ability 
to communicate and socialize. In recent decades, several studies have demonstrated a link between stuttering and 
abnormal Electroencephalographic (EEG) B-power in cortex during speech preparation and production. 

Aim: This study investigated the efficacy of a novel auditory neuromodulating technology that leverages euphonic 
music tracks with broad-spectrum binaural beats to induce selective EEG spectral power changes. 

Methods: Adults with Stuttering (AWS, n=6) and participants from the control group without speech pathologies 
(n=6) were exposed to binaural stimuli for 5 minutes. EEG and Electrocardiographic (ECG) bio-signals were recorded 
before, during, and after exposure. 

Results: During standard reading tasks without stimulation, in controls but not in the AWS group, B-power was 
significantly higher in the left hemisphere than in the right hemisphere. After stimulation, the power of the B-band 
in AWS participants in the left hemisphere increased 1.54-fold, while changes in the right hemisphere activity were 
not significant. The average B-band power within the left front temporal area and temporoparietal junction of the 
cortex after stimulation in AWS participants shows an increase by 1.65-fold and 1.72-fold respectively. The rate of 
disfluency dropped significantly immediately after stimulation (median 74.70% of the baseline rate), but the effect 
was not significantly different from the baseline 10 min later. Similarly, the speech rate significantly increased 
immediately after stimulation (median 133.15%) but it was not significantly different 10 min later. Furthermore, we 
found significant correlations of experimental boost of B-power in left temporoparietal projection (Spearman p=-0.54) 
and left front temporal area (Spearman p=-0.58) with disfluency rate of speech. 

Conclusion: We show for the first time that auditory binaural beat stimulation can substantially improve speech 
fluency in AWS, and its effect is proportional to boost in EEG fB-band power in left front temporal and 
temporoparietal junction of cortex. Changes in B power were detected immediately after exposure and persisted for 
10 min. Additionally, these effects were accompanied by a reduction in stress levels, as monitored by ECG markers. 
This suggests that auditory binaural beat stimulation temporarily improves speech quality in AWS by increasing the 
EEG B-band power. 
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INTRODUCTION 


Stuttering is a speech fluency disorder ICD-11, World Health 
Organization, 2019 that is usually diagnosed in infancy, 
childhood or adolescence. Stuttering involves impairment in 
speech fluency and is characterized by frequent repetitions on 
prolongations of sounds or syllables. Even though 80% of 
children recover, both with natural development and therapy, 
the remaining 20% continue to stutter in adulthood, which 
accounts for 1% of the adult population [1]. The causes of 
stuttering are still poorly understood but are usually attributed 
to speech motor control disruption due to abnormal electrical 
activity of the brain, anatomical abnormalities, genetic and 
psychological 
developmental delay. Neurophysiological studies have offered 


molecular factors, stress or trauma, or 
the promise of actionable discoveries regarding stuttering causes. 
Neuronal electrical activity of the brain is conventionally 
separated into several frequency bands: 6 (0.5 Hz-4 Hz), 0 (4 
Hz-7 Hz), a (7 Hz-13 Hz), and B (13 Hz-30 Hz). Several studies 
have shown that activity in the Electroencephalographic (EEG) 
B spectrum range, which is commonly heightened in the awake 
state and focused activities is reduced in individuals with 
stuttering. EEG and Magneto Encephalographic (MEG) studies 
have shown that in Adults Who Stutter (AWS), compared to the 
control group, a power is higher while B power is lower during 
speech production. This may be due to the role of the B-band in 
coordinating speech planning and execution within the speech- 
production brain network, as it is known that P-oscillations 
become more coherent between the bilateral primary motor, 
premotor, and auditory cortices before speech production. 
Additionally, EEG and MEG studies suggest that coordination 
between auditory clues and motor excitation, which is carried 
out through B oscillations is affected in AWS. 


Mersov, et al., studied neural oscillations in the speech motor 
network during preparation and execution of planned speech 
production in AWS using Magneto Encephalography (MEG). 
Compared to controls, AWS showed stronger B suppression in 
the speech preparation stage, followed by stronger B 
synchronization in the bilateral mouth motor cortex. AWS also 
recruited the right mouth motor cortex significantly earlier in 
the speech preparation stage compared to controls [2-7]. 
Exaggerated motor preparation is believed to be linked to 
reduced coordination in the speech motor network in AWS. 
researched EEG during the spontaneous pre-speech preparation 
process in AWS. But, they found that stuttering severity is linked 
to a stronger EEG B-power during speech intention. The more 
severely a person stuttered, the stronger the alpha power in 
bilateral frontal, and low B power in the mid anterior and right 
central EEG sensors. AWS with more severe stuttering seem to 
show stronger maintenance of the current cognitive or 
sensorimotor state, as stuttering severity was associated with 
increased beta power. Thus, findings are differing from 
described in Mersov, et al. and might be explained by differences 
in study designs and with regard to this study by another 
cognitive process during speech planning. Such differences in 
results suggest that B-power suppression occurs differently in 
AWS with different severity of stuttering, not only during pre- 
speech formation, but also during the pronunciation process [8]. 
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A complex pattern of sympathetic-parasympathetic balance 
emerges in literature on speech-related stress. On one hand, 
speech production and especially speech production in AWS is 
correlated with higher sympathetic activation. On the other 
hand, AWS tends to develop a compensatory mechanism by 
increasing parasympathetic activation and reducing heart rate, a 
phenomenon termed an autonomous coactivation of ANS [9]. 
The effect of sympathetic activation reduction by relaxation 
auditory stimulation of binaural beats with frequency carrier 
components in @-range of brain activity could accompany the 
parasympathetic compensatory mechanisms. Thus we 
hypothesize that reduction of sympathetic activation and 
parasympathetic activation may work quite similarly to internal 
compensation stress reduction mechanisms in AWS and should 


help to alleviate the severity of stuttering symptoms [10]. 


Several methods have been developed before to entrain the 
brain oscillations. Synchronization of brain activity at the 
frequency of sensory stimulation and its harmonics in and 
beyond the corresponding sensory brain areas has been achieved 
with visual flicker, somatosensory tactile stimulation and 
auditory stimulation [11]. Popular approaches for auditory brain 
entrainment include isochronous sounds, monaural beats and 
binaural beats. Stuttering reduction using noise and auditory 
voice feedback treatments has been explored for several decades. 
The feasibility of entraining the B-band by isochronous sound 
exposure has been demonstrated in healthy adults and in AWS 
[12]. 


To define the regions of interest within the cortex we noted that 
earlier studies found that brain activity EEG patterns normally 
seen during speech production in non-stutterers were either 
absent, bilateral in nature, or lateralized to the left hemisphere. 
while in AWS compensation effects were found in the right 
hemisphere. Increased gray matter volume in right hemisphere 
motor auditory homologues coupled with increased white 
matter density in the fiber tracts connecting them have been 
interpreted as evidence of right hemisphere compensation [13]. 
B-band activation in the premotor cortex plays the important 
role of fluent speech production. 


Given earlier results suggesting a possible connection between 
stuttering and the deficit in spectral power of the B EEG band 
and the lack of activation in speech production centers of the 
brain, we hypothesized that an increase in B-power with binaural 
beat stimulation might temporarily alleviate disfluency episodes 
and improve speech quality in AWS [14]. 


To test our hypothesis, we exposed people with Stuttering (AWS) 
and fluent individuals (controls) to proprietary binaural beat 
soundtracks. Simultaneously, we recorded the EEG, ECG, and 
speech production. To reveal the influence of auditory 
stimulation on electrophysiological brain activity, we analyzed 
the whole EEG spectrum and performed separate analysis for 
the electrodes group located over the Left Frontotemporal (LFT) 
region and Temporoparietal junction (LTP) of the cortex. As 
speech production may be linked to changes in activation of 
sympathetic ANS, we monitor sympathetic activation in 
response to auditory stimulation using Heart Rate Variability 


(HRV). 
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MATERIALS AND METHODS 


Experimental design 


Adult volunteers with stuttering (AWS, 6 males; aged 18 years-38 
years; mean age: 28.3 + 7.1 years) and fluently speaking 
volunteers (controls, 6 males; 26 years-52 years old; mean age: 
32.0 + 10.1 years) signed informed consent and participated in 
the study at the laboratory of faculty of physics, electronics and 
computer systems of Dnipro national university. The study was 
approved by the ethical commission of Dnipro national 
university, protocol #15 from 2021-05-17. All the participants 
were right-handed and had no history or complaints of hearing 
difficulties. The stuttering participants received no medications 
or stuttering therapy during the experiments. None of the 
control participants reported a history of speech, language, or 
hearing difficulties. The experimental protocol consisted of six 
stages, 5 min each (Figure 1A). 


¢ Baseline exposure under relaxation and with eyes closed, 

¢ Reading activity pre-stimulation, 

¢ Auditory stimulation, 

¢ Reading task activity immediately after the stimulation, 

¢ Reading task activity after 10 min of relaxation, and 

* Resting state measurement with eyes closed. During each 
stage, the EEG and ECG signals were continuously recorded. 


The reading aloud was assayed in the participants' native 
language (Russian) for five minutes. Voice was recorded using a 
builtin microphone in a MacBook Pro 13" (2015) laptop. 


Auditory stimulation 


An auditory euphonic Binaural Beat (BB) stimulus with three 
spectral components (6, a, and 8) was prepared using a custom 
generator and post processing algorithms United Kingdom 
patent application No. 2116927.1. Ambient royalty-free music 
with vocals was used as a carrier to specifically target the brain 
cortical centers responsible for voice recognition (audio 
recording can be found in supplementary materials in the folder 
“stimulus”). To produce a euphonic binaural stimulus, we 
applied a low-pass filter with a cutoff of 170 Hz, which was 
derived empirically for the chosen music background (to avoid 
reverberations of sound). The resulting track was normalized in 
the final stage, and raw stereo audio data were generated. The 
stimulus used in this study contained binaural components at 3 
Hz, 7 Hz and 21 Hz produced by frequency shift between left 
and right channel with a low-pass cut-off of the binaural effect at 
170 Hz (supplementary Figures 1A and 1B). The spectral 
characteristics of the resulting auditory output stimuli are 
shown in supplementary Figure 1C. The output Sound 
Pressure Level (SPL) of the stimulus was normalized to the 73 
dB level to match the optimal level of auditory binaural 
stimulation and volume was kept at constant level throughout 
the stimulation stage. The auditory stimulus was delivered using 
high-quality wireless sennheiser momentum true wireless 2 
headphones with an output frequency response band in the 
range of 5 Hz to 33,000 Hz. All participants confirmed that the 
used SPL would not distract them and described the sound level 
to be comfortable (Figure 1). The headphones were worn by 
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participants during the whole experiment, but the stimulus was 
played only during the stimulation phase [15]. 
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Figure 1: The design of the study. A) Scheme representing the 
design of the experiments; B) Location of EEG electrodes (32 in 
total) used during all experiments with highlighted groups of 
electrodes over the left temporoparietal junction (green on 
upper topography) and left frontolateral area (blue on upper 
topography) and groups of electrodes used for the whole- 
hemisphere analysis (bottom topography, blue for the left and 
green for the right hemisphere). (EEG) 
Electroencephalographic. 


EEG registration and signal processing 


A medical-grade digital brain electric activity monitor (contec 
KT88-3200, S/N 591012399249) was used to collect raw EEG 
data using conventional wet cap electrodes. Raw data were 
collected using the proprietary companion software tool contec 
EEG32. The raw output data were saved in the European EDF+ 
format. We used 10/20 EEG electrode placements according to 
the international standard system Acharya, et al. with reference 
electrodes Al and A2 (the electrodes configuration is shown in 
Figure 1B). 


The sampling rate for each channel was 200 Hz and the 
analogue front-end of the EEG device has 0.1 pV resolution and 
a 0.016 Hz-1000 Hz input signal frequency range. The input data 
were digitally filtered with a f=0.1-30 Hz band pass filter in order 
to extract useful EEG signals and exclude Electromyography 
(EMG) artifacts. Brain electrical activity was post-processed using 
Fast Fourier Transform (FFT) followed by Power Spectrum 
Density (PSD) [16]. The from Electromagnetic 
Interference (EMI) was reduced by applying a notch filter with a 
cut-off frequency of 50 Hz. Episodes of signals with bad contact 
of electrodes were excluded from the record by the automatic 
built-in lead-off detection function of the EEG monitor. 
Additionally, EOG artifacts were extracted using Independent 
Component Analysis (ICA) infomax decomposition algorithm. 
The absolute power was numerically integrated over each 
frequency of the following bands: 6 (0.5 Hz- 4 Hz), 8 (4 Hz- 7 
Hz), a (7 Hz- 13 Hz), and B (13 Hz- 30 Hz) over each electrode's 
position (32-electrode EEG measurement). Consequently, an 
average of 32 channels was used for the statistical analysis of the 
spectral power. The spatial distribution of the PSD calculated 
using a standard formula was visualized as a heatmap for each 
EEG spectral band. To obtain the power of electrical brain 
activity in areas associated with speech production processes, we 
calculated the mean average power at the left temporoparietal 
junction (anatomically associated with Wernicke's area) within 


noise 
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electrodes C3, CP3, P3, P7, TP7, T7, and the left 
frontotemporal area within electrodes F3, FC3, F7, and FT7 
(anatomically area) (Figure 2). 
Topographies, compressed spectrum graphs, and trend graphs 
were produced using open-source EEGLAB v.2021.1 software 
(Figure 3). 


associated with Broca's 


if 
Figure 2: ECG-based markers of stress during reading tests. 
A) HRV markers (meanRR, heart rate, stress index, RMSSD, 
LF, HF) are shown for AWS and control group in baseline rest 
state, reading before the stimulation, immediately after (0 
min), and 10 min after the binaural beats stimulation. 
RR intervals deviationwith HRV 
characteristics before stimulation; B) Right after stimulation; 


time demonstrate 
C) And 10 min after stimulation; D) Note an increase in 
amplitude and higher frequencies alongside with reduction in 
regularity of oscillations exemplifying physiologic relaxation. 
(ECG) Electrocardiography,((HRV) Heart Rate Variability, 
(AWS) Adults with stuttering,(RMSSD) Root Mean Square of 
Successive Differences, (LF/HF) Low-to-high Frequency ratio. 
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Figure 3: EEG $-power change at the different stages of experiment. 
A) EEG f-waves in reading experiments in AWS (top) and control 
group (bottom) averaged across all 32 electrodes; B) Power in f- 
band for left and right hemispheres (left and right, respectively) 
during the reading activities in AWS and control; C) Power in f- 
band averaged across electrode groups located over the LTP junction 
(left) and LFT area during the reading activities in AWS and 
controls. (EEG) Electroencephalography, (AWS) Adults With 
Stuttering, (LTP) Left Temporoparietal junction, (LFT) Left 
Frontotemporal area. 
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ECG registration and signal processing 


Electrocardiographic (ECG) recordings were conducted using a 
contec 8000GW device (HW S/N: 39124881923) with four wet 
electrodes placed on the limbs to obtain a conventional six-lead 
ECG system (I, II, III, avL, avR, and avF). The input analog 
ECG signal was amplified with 1000x gain by a low-noise analog 
frontend. To remove artifacts, ECG-signal segments with poor 
contact of electrodes were detected and removed using builtin 
AC-current lead-off detection [17]. The input signal was low-pass 
filtered with a Butterworth IIR filter of order n=8 and cutoff 
frequency f=30 Hz to 
Electromyographic (EMG) artifacts. Baseline drift was removed 
with a high-pass filter Butterworth IIR filter of order n=8 and 
cutt-off frequency f=0.67 Hz. The sampling frequency was 1000 


prevent aliasing and exclude 


Hz, while passband attenuation was taken at 1 dB. The 
influence of EMI was reduced by applying a notch filter with a 
cut-off frequency of 50 Hz. Raw ECG data were exported to 
HL7 aECG standard XML format (Figure 4). 


Figure 4: Topography and frequency specificity of binaural beats 
stimulation-induced changes. A) Topographical distribution of B- 
power before, after and 10 minutes after the stimulation; B) 
Coherence between LTP junction and LFT area during speech 
in AWS, before, immediately after and 10 min after stimulation; 
C) Coherence between LTP junction and LFT area during 


speech in controls, before, immediately after and 10 min after 


stimulation. (AWS) Adults With Stuttering, (LTP) Left 


Temporoparietal junction, (LFT) Left Frontotemporal area. 


The state of the Autonomic Nervous System (ANS) and stress 
level were assessed by Heart Rate Variability (HRV) using a set of 
standard metrics calculated from ECG RR-intervals (heart beat- 
to-beat intervals) forth by the European society of cardiology and 
North American society HRV standards (“heart rate variability,” 
1996): MeanRR, mean heart rate, standard deviation of NN 
intervals (SDNN), Root Mean Square of Successive Differences 
(RMSSD), (LF/HF), Low 
Frequencies power (LF), High Frequencies power (HF), and 
Baevsky stress index. The Baevsky stress index (SIdx), which is a 


Low-To-High Frequency ratio 
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measure of HRV reflecting cardiovascular system stress. High 
values of SI indicate reduced variability and high sympathetic 
cardiac activation. Sldx was computed according to the formula 
(Baevsky RM; Berseneva AP, n.d.); 


_— wm_Alo~x 100% 
SIdx = 2 Mo xX MxDMn’? 


Where; 
AMo is the mode amplitude presented in percentage points, 
Mo is the mode (the most frequent RR interval), 


MxDMn is the variation scope reflecting the degree of RR 
interval variability. 


Mode Mo was simply calculated as the median of the RR 
intervals. The AMo was obtained as the height of the 
normalized RR interval histogram (bin width 50 ms) and 
MxDM)n as the difference between the longest and shortest RR 
interval values. To make SIdX sensitive to slow changes in 
mean heart rate (which would increase the MxDMn and lower 
AMo), the very low frequency trend is removed from the RR 
interval time series using the smoothness priors method (Figure 


5). 


speech rate during 


Figure 5: Speech disfluency and 


experiments. A) Speech disfluency rate before, after and 10 
min after binaural beats stimulation; B) Speech rate before, 
after and 10 min after binaural beats stimulation. 


Speech quality assessment 


Speech recordings were evaluated by speech therapists for a 
range of disfluencies, such as repetitions of syllables, whole 
words, sound prolongations, and blocks. Each audio recording 
was scored blindly by a speech therapist (a participant ID was 
provided but category). 
Disfluencies were measured in terms of the number of 


neither experimental stage nor 


repetitions (separately for sounds, syllables, and words), 
prolongations, and blocks. The total number of disfluencies per 
experiment was normalized to the total number of words read 
[18]. As the texts differed from one stage of the experiment to 
the other (each time a new unfamiliar text was proposed for 
reading to exclude memorization effect), it was necessary to 
calculate the relative number of disfluencies for each 
measurement. The same set of texts was provided to all 
participants at respective stages of the experiment. Speech rate 
was estimated as words read per minute. At the end of the 
experiment, participants rated the pleasantness of auditory 
stimulation and its effect on speech abilities as well as their 
overall stress before and after stimulation with available options: 
Easier to speak after stimulation; no change; harder to speak 


after stimulation. 


J Clin Res Bioeth, Vol.14 Iss.3 No:1000464 


OPEN a ACCESS Freely available online 


Statistical analysis 


To examine the effect of treatment for each cohort (AWS, 
controls) on physiological readouts while accounting for 
repeated experiments within participants, we applied Linear 
Mixed effects Models (LMM). Every set with physiological raw 
data was processed and scored blindly by automated script 
written in python programming language (Figure 6). 


of eewencpwees gece) WV a naten Oe torres yere wee 


Figure 6: Correlation between speech disfluencies and B-power 
over the speech-relevant centers after beats stimulation (0 min 
and 10 minutes after). A) Correlation between disfluency rate 


per AWS participants and B-band power average electrodes over 
the LFT area. (AWS) Adults With Stuttering, (LTP) Left 


Temporoparietal junction, (LFT) Left Frontotemporal area. 


All statistical analyses were done using the R programming 
language. We researched the relationships between changes in 
spectrum power of brain cortex electrical activity after 
stimulation and disfluency rate of speech. To assess relationships 
across these variables, non-parametric tests (i.e., spearman’s rank) 
were used where variables were not normally distributed [19]. 
The effects due to participants’ individual physiology were 
treated as a varying intercept random effect, effects due to the 
experiment were treated as random effects nested within the 
participants’ effects, and effects due to the cohort or treatment 
were treated as fixed effects. Analysis was performed using Ime4 
and Imer test libraries in R software, and the significance of the 
fixed effects was reported. Speech characteristics were compared 
using the wilcoxon ttest for a single (first) experiment per 
participant [20]. 


RESULTS 


Auditory stimulation results in reduction of stress 
level as monitored by ECG 


To assess the reaction of AWS and controls on reading tasks we 
compare HRV parameters in the resting state and during the 
initial reading. The reaction of AWS and controls was different. 
In controls unlike AWS, heart rate and LF power during the 
reading increased relative to the baseline resting state (by 1.04 + 
0.06 p=0.03 in wilcoxon test and 2.51 + 1.69 p=0.03 fold, 
respectively, in wilcoxon test). Next, we evaluated the effect of 
auditory stimulation on general stress level, heart rate, and HRV 
parameters to elucidate ANS state. A significant decrease was 
observed in the heart rate immediately after stimulation 
compared to the baseline (p=0.024 in wilcoxon test). After 10 
min, the heart rate stayed significantly lower only in the control 
group (p=0.01 in wilcoxon test). Moreover, the Baevsky's stress 
index was down in both AWS and control group relative to the 
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baseline both immediately after the exposure (by 1.16 + 
0.12 p=0.01 in wilcoxon test and 1.26 + 0.19 fold, respectively, 
p=0.03 in wilcoxon test) and 10 min later (p=0.01 and 
p=0.03 in wilcoxon test, Figure 2A). No © significant 
difference was found in the HRV RMSSD after stimulation. 
An example of heart rate modulation during the experiment 
is shown by an ECG series for one AWS participant before, 
immediately after, and 10 min after stimulation (Figures 2B-2D). 


AWS display distinct spatial signatures of brain 
activation during speech production 


To evaluate the baseline electrophysiological differences between 
AWS and controls, we measured EEG characteristics for both 
groups. No significant difference in B power averaged across all 
electrodes was found between the control and AWS cohorts in 
the baseline rest state (p=0.43 in Wilcoxon test, supplementary 
Figures 2A and 2B). The total power in the B-band in the control 
group was significantly higher in the left hemisphere than in the 
right hemisphere (p=0.03 in wilcoxon test) during the baseline 
state measurements. However, electrical activity in the AWS 
during the resting state in the left and right hemispheres was the 
same (p=0.38 in wilcoxon test). 


Next, during the reading task in the baseline state, the average 
power of the B-band did not differ significantly between the AWS 
and control groups (p=1.0 test). Left-right 
hemispheric asymmetry in B-band power was observed in the 
control group (p=0.03 in wilcoxon test) but not in the AWS 
group (p=0.74 in wilcoxon test). Detailed results of the 
hemispheres electrical activity distribution during the experiment 


in wilcoxon 


are shown in (supplementary Figure 2A and 2B, supplementary 
Table 1). 


The B power in LTP junction and LFT cortex area during the 
reading task did not significantly increase neither in AWS (p=0.64 
in wilcoxon test, p=0.54 in wilcoxon test) nor in the control 
group (p=0.06 in wilcoxon test in controls, p=0.15 in wilcoxon 
test in controls) relative to the baseline rest state. However, the 
relative average power of the B-band in the LTP junction and LFT 
area correlated across participants in the control group (spearman 
p=0.89, p=0.03, supplementary Figure 3) and AWS group 
(spearman p=0.9, p=0.05, supplementary Figure 4). 


Auditory stimulation enhances B spectral power and 
leads to spatial redistribution of brain activity 


To assess the effectiveness of the auditory binaural beat 
stimulation in enhancing modulated frequencies of brain 
electrical activity, we recorded and analyzed EEG activity before, 
during (supplementary Figure 5), and after stimulation, at rest, 
and during reading activity, as shown in Figure 1, and analyzed 
the overall power of the B band, as well as each hemisphere, and 
within cortex projections related to the speech production. 


The overall absolute B power in the control group was 
insignificantly increased by 1.4-fold after stimulation (p=0.16 in 
wilcoxon test) and stayed at 1.2 fold 10 min after compared to 
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the baseline reading (p=0.56, wilcoxon test). The overall B-power 
in the AWS increased by 1.32-fold 


stimulation (p=0.15 in wilcoxon test) and remained at 1.14 fold 


insignificantly after 
10 min after stimulation (p=1.0 in wilcoxon test; Figure 3A) 
compared to the baseline reading. 


Detailed analysis of spatial distribution showed that after 
stimulation, the power of the B band in AWS participants in the 
left hemisphere increased 1.63 fold (with p=0.01 in wilcoxon 
test), while changes in the right hemisphere activity were not 
significant (p=0.74 in wilcoxon test, Figure 3B). In the control 
group, no significant changes in B power were observed in either 
brain hemisphere after stimulation (p=0.44 in wilcoxon test for 
left; p=0.09 in wilcoxon test for right hemisphere, Figure 3B). 
Spatial changes were visualized as differences in the topography 
of grand-averaged normalized B brain activity during text reading 
after stimulation (Figure 4A). 


Next, we analyzed the average B-band power within the LFT and 
LTP junctions. After the stimulation, the average power of the B- 
band within LTP projection in AWS participants increased by 
1.74 fold after stimulation (p=0.03 in wilcoxon test) and stayed at 
1.65 fold 10 min after stimulation (p=0.15 in wilcoxon test 
compared to the baseline). After stimulation, the average power 
of the B-band within the LFT area in AWS participants increased 
by 1.72 fold after stimulation (p=0.01 in wilcoxon test) and 
stayed at 1.61 fold 10 min after stimulation (p=0.84 in wilcoxon 
test compared to the baseline). Coherence analysis between the 
electrodes electrical activity associated with LFT area and LTP 
junction in AWS cohort shown that average coherence within B 
band before stimulation was equal to 0.628+ 0.009 (I=64.44, 
Ibeta=7.54), after stimulation 0.66 + 0.01 (I=60.73, Ibeta=7.91) 
and 10 min after 0.606 + 0.01 (I=62.91, Ibeta=7.27). At the same 
time, in the control group the coherence within B band before 
stimulation was 0.21 + 0.01 ([=21.29, Ibeta=2.61), after 
stimulation 0.23 + 0.01 (1=32.23, Ibeta=2.75), and after the next 
10 min 0.1 + 0.01 (I[=22.64, Ibeta=1.24) (Figure 4D). 
Additionally, we observed a cross-sectional correlation between 
the activity of the two areas in both AWS (supplementary Figure 
6A) and control (supplementary Figure 6B) groups immediately 
after the stimulation (Spearman p=1.0, p=5e-5 at AWS and 
p=0.94, p=0.01, in controls) and 10 min later (Spearman p=0.86, 
p=0.01, in AWS and p=0.97, p=0.001, in controls, respectively). 
Cross-sectional correlations for each patient are shown in 
supplementary Figure 7. 


Auditory stimulation leads to improvement of 
speech quality and conditional f-activation in 
speech centers in AWS. 


To investigate the effect of auditory stimulation on speech 
quality, we recorded speech samples before and after the 
stimulation (Figure 1). The speech rate (words per minute) and 
rate of disfluencies (the ratio of disfluencies to the total number 
of words uttered) were counted in the AWS group by a speech 
pathologist. The rate of disfluencies dropped after the 
stimulation (median 74.70% of the baseline rate with 90% CI: 
37.11%-98.16%, p=0.04 in paired wilcoxon test, Figure 5A), but 
there was no significant difference from the baseline 10 min 
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later (median 65.51% of the baseline rate, 90% CI: 39.15%- 
93.70%, p=0.15; Figure 5B). Similarly, speech rate increased 
immediately after stimulation (median 133.15% of the baseline 
rate, p=0.04), but was not significantly different 10 min later 
(median 126.63% of the baseline rate, p=0.08). Immediately 
after stimulation, six out of the six participants noted the 
subjective ease of reading the text aloud (short questionnaire 
described in methods). Speech recordings showed a smoother, 
clearer, and faster text pronunciation for AWS after stimulation 
(see the link on records in supplementary materials). 


Improvement of speech quality is correlated to the 
specific brain activity patterns 


As speech improvement is expected to be mediated by activation 
of the LFT area and LTP junction, we analyzed the correlation 
between changes in the rate of disfluencies and activation in the 
respective areas. To elucidate the contribution of auditory 
stimulation-induced changes in electrical activity to changes in 
speech fluency, we compared changes in B band power in the 
LFT and LTP junction before and after stimulation to the 
disfluency rate. Improvement in fluency after stimulation and 
10 min after compared to the reference reading stage correlated 
with an increase in the power of the B band in both LTP 
junctions (spearman p=-0.54, p=0.03, Figure 6A) and LFT area 
(spearman p=-0.58, p=0.02, Figure 6B). 


When comparing speech fluency between the AWS participants 
who responded with an immediate increase in B-power within 
the speech centers upon auditory stimulation (n=4, average 
increase by 2.3 fold) to the rest of the AWS participants (n=2), 
the former displayed an improvement in speech fluency both 
immediately (30.03%, p=0.03 in the paired wilcoxon test) and 
after 10 min (25.38%, p=0.03 the paired wilcoxon test). 
Additionally, in the responder sub-group, the increase in B 
power persisted after 10 min (1.77 fold relative to the baseline 
reading stage, p=0.03 in the paired student's t-test, p=0.05 in the 
paired wilcoxon test). This suggests that improvement in speech 
quality may be mediated by activation of the B-frequency band 
in speech centers, and the change in spectral power of the B- 
band may serve as a surrogate variable predictive of functional 
response in speech quality. 


DISCUSSION 


In this study, we assessed the responses of adults with and 
without stuttering to auditory stimulation with binaural beats 
using a broad range of physiological metrics. 


A hypothesis that drove our efforts stated that participants with 
stuttering may exhibit lower B-power than controls. We were 
unable to confirm this hypothesis in our cohort, potentially due 
to the low number of participants. However, we observed 
hemispheric asymmetry in f activity in controls but not in AWS, 
which is in agreement with observations in children who stutter. 
Next, we were able to induce an increase in B-power during 
reading activity in AWS but not in controls by means of 
auditory binaural beat stimulation. This did not create a 
significant left-right asymmetry in participants with stuttering. 
However, the increase in B-power in the AWS was significant 
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only in the left hemisphere. This is a desirable effect, as the 
speech centers localize to the LTP junction and LFT cortex area. 
Furthermore, we observed that upon auditory stimulation, the B- 
power increased in left projections in AWS but not in controls. 


Next, we demonstrated that auditory stimulation improved 
speech quality. The effectiveness of B-range BB stimulation in 
improving speech quality in AWS agrees with earlier studies, 
suggesting a link between B deficit and stuttering. Moreover, our 
results suggest that there is a dose-effect relationship between 
speech improvement and the increase in EEG B-power. As a 
further step, we need to explore the effect of stimulation on a 
larger group of AWS with a longer follow-up period. 


Stuttering episodes are often accompanied by stress and anxiety 
which is another common target of stuttering therapy. In this 
study, we probed stress levels by monitoring participants’ heart 
rate and Heart Rate Variability (HRV) by ECG RR intervals. We 
show that the response to speech related stress in AWS and 
controls is different. During the reading task controls show 
significant increase in heart rate and sympathetic activation 
(increasing of LF spectrum power), while AWS do not. This 
substantiates the existence of compensatory mechanisms in 
AWS. Additionally, our stimulation reduced HR and increased 
HRV in AWS. We observed a significant drop in the stress index 
after exposure to the stimuli in participants with stuttering and 
controls, suggesting a reduction in stress levels. This is in 
agreement with earlier work showing that brain waves can be 
entrained with BB which may cascade into other physiological 
functions, such as HRV. 


As a further step, we plan to reproduce results with sham audio 
stimuli without BB and study the effect of both amplitude and 
exact frequency of binaural shift in Brange of brain waves 
(currently set to 21 Hz) on fluency improvements. Other tone 
frequencies should be further investigated because the optimal 
resonant frequency can be individualized for different patients. 
The hypothesis of calibration of the modulated frequency may 
depend on individual brain electrical activity and may be 
reflected in speech phonetic patterns. Cross-over design and 
placebo trials of the experiment should be conducted to clarify 
the effects of the stimulus that do not depend on groups of 
participants. The longterm effects of exposure, including 
possible habituation or long-term exposure, as well as the impact 
should be — further 


investigated. The duration and effect of carrier music 


on participants' general well-being, 
parameters, as well as variations in the binaural shift frequency 
on the effect duration, may be investigated in more detail in 


future research. 


No adverse effects were observed in this study. The participants 
did not experience negative or irritating effects or fatigue after 5- 
minutes of BB stimulation. Thus, longer stimulation may also 
be considered because of the non-invasiveness and absence of 
side effects of the proposed approach. In addition, such an 
auditory entertainment approach can be used in a multi-day 
course with several sessions. 


An in-depth understanding of effects of the auditory BB 
stimulation and its capability to temporarily reduce stuttering 
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symptoms may be useful for end-user applications. The findings 
of this work may be used for design of digital therapeutic 
techniques for the treatment of speech disfluencies. 
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